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1. Introduction 

In [I| some connections were made between string theoretic models of black holes and 
certain issues in arithmetic. One of the more speculative suggestions in ]]J was a proposal 
that the entropy of BPS black holes is related to an arithmetic height of certain arithmetic 
varieties. In the present note we remark on some connections between these speculations 
and some issues related to the Riemann hypothesis. In particular, the speculations seem 
most relevant to the question of "Landau-Siegel zeroes," which are hypothetical zeroes of 
L-functions very close to s = 1. (A precise definition is given in definition 4.4.) 

In section 2 we review the theory of Strominger-Vafa ||. In section 3 we summarize, 
reformulate, and sharpen the statements from |1[ whose implications we wish to explore. 
In section 4 we provide some background information on analytic number theory and 
Dirichlet L-functions, and in section 5 we see how everything fits together. In particular, we 
discuss a close interplay between the Strominger-Vafa prediction about black hole entropy, 
the Landau-Siegel zero, and the minimal discriminant of an elliptic curve with complex 
multiplication. 

Warnings: This note is written for a mixed audience of both string theorists and 
analytic number theorists, so some trivial things are explained. We caution the reader at 
the outset that the evidence for the height conjecture of [I] is slim, to say the least. Thus, 
this paper should be regarded as an exercise in Pascal's Wager. 

2. Summary of the attractor mechanism and the Strominger-Vafa proposal 

Let X be a compact Calabi-Yau 3-fold, and let 7 G H 3 (X; It) be an integral homology 
class. String theory associates two interesting mathematical objects to the pair (X, 7) : 

a. ) A finite-dimensional Hermitian vector space 7i(7). 

b. ) Another Calabi-Yau X 1 , in the same complex structure moduli space as X. 

We can interpret (a) and (b) both mathematically and physically. The physical setting 
is the theory of BPS black holes in d = 4, M = 2 compactifications of type II string on a 
Calabi-Yau manifold X. We now explain (a) and (b) in a little more detail. 
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First consider (a). In physics is the space of BPS states of charge 7. The 

definition of this space has not been completely rigorously formulated mathematically, 
although this should be possible using the theory of .D-branes. Very roughly speaking 
7^(7) should be defined mathematically as follows. Consider the moduli space M.{~i) of 
pairs (£, A) where E is a smooth special Lagrangian submanifold of X in the homology 
class 7, and A is a flat U(l) connection on S. The moduli space M.{^) inherits a metric 
from the Calabi-Yau metric on Y, and 7i(7) is the L 2 -cohomology of @ 

Now consider (b). By Yau's theorem, a Calabi-Yau manifold may be specified by its 
complex structure and its Kahler class. X will belong to a family of Calabi-Yau manifolds 
with complex structures in A4 cp i x . (For what follows we will need to work on the universal 
cover Ai C pix-) The map (b) is provided by the "attractor mechanism" of Ferrara, Kallosh, 
and Strominger @] |§ 0- For each 7 there is a dynamical system on the moduli space M. C pix 
and, for suitable 7, i the dynamical system will have a unique fixed point X 1 . Associated 
with the Calabi-Yau X 1 is the normalized period 

1 r a. 12 

J 7 

fx~, ^7 ^ ^7 I 



1^(7) I 2 := , r ' J7 " 7 '^ , (2-1) 



where 7 is a nowhere vanishing holomorphic (3, 0) form on X 1 . 

The connection between objects (a) and (b) is provided by the Strominger- Vafa pro- 
posal for the microstates accounting of black hole entropy . The idea may be summarized 
in the following four steps: 
1. Given a Calabi-Yau X , one may write a system of partial differential equations for 
a Minkowski-signature 4-manifold M equipped with certain geometrical data (e.g. a 
rank ^b^{X) torus bundle with connection). These equations generalize the Einstein 
equations on M and are called the supergravity equations. A choice of charge vector 
7 is equivalent to a choice of boundary conditions for these equations. For appropriate 
vectors 7 (those for which the normalized period has an isolated minimum in M. C plx), 
the supergravity equations admit black hole solutions #(7). Using the supergravity 



1 In the mirror formulation 7 would specify the Chern classes of a coherent sheaf in the mirror 
Calabi-Yau X, and At (7) would be the L 2 cohomology of the moduli space of such sheaves. 

2 Some discussion of which 7 are suitable is given in JTJ. 



equations one may compute the horizon area A (7) of the black hole solution -6(7). It 
turns out that 

A( 7 ) = 4vr|Z(7)| 2 . (2.2) 

We work in Planck units £ p i a nck = 1- 

2. According to Bekenstein and Hawking a black hole is a thermodynamical object. It 
has a temperature and an entropy, and the latter is given by: 

Ssu gr a(7) = (2-3) 

3. According to Boltzmann and Planck, we have the exact formula: 

S micro ( 7 ) = Hog[W( 7 )] (2.4) 

where Wi^f) is the dimension of the space of available states in the micro canonical 
ensemble specified by the charges 7 and the energy M = ^(7) |. Here k is Boltzmann's 
constant; we henceforth choose units with k = 1. 

4. According to Strominger and Vafa, for charge vectors 7 which are in some sense large, 
physical states formed by BPS configurations of D3-branes in X with charge 7 can 
be described macroscopically, in a supergravity approximation, by the black hole so- 
lutions $(7) to the supergravity equations. From the microscopic, D-brane viewpoint 
we identify W = dim 7i (7), for all 7, large or small. The correspondence between 
configurations of .D-branes and the black hole solutions is formulated mathematically 
as the statement that for large 7 

Smicro(7) ~ 5's U gra(7)- ( 2 - 5 ) 

Putting all the above statements together one arrives at the mathematically startling 
proposal that: 

log[dimft(7)] ~ 7r|Z( 7 )| 2 (2.6) 

for large 7. Both sides of equation ( |2.6|) are susceptible of precise mathematical definition. 
Moreover, such a statement would be a deep and surprising mathematical fact. Before 
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that becomes a reality, several clarifications of the meaning of (|2.6| ) must be carried out. 

In particular we need: 

1. A precise definition of TC("y) and a definition of dim TC("y) (since 7i(7) is a graded vector 
space, we might well have to use a graded dimension.) 

2. A precise statement of the meaning of "large 7." 

3. A precise statement of the meaning of the asymptotic symbol ~. 

It is quite necessary to take a limit of "large 7," in order to justify the supergravity 
approximation to string theory, and thereby the connection to black holes. We will take it 
to mean operationally that we consider sequences 7 n of charges such that z n := |Z(7 n )| — > 
00. We refer to such a sequence of charges as a big sequence. 

Another subtlety which we would like to mention is the concept of effective asymp- 
totics. For example, one could make a statement that a certain quantity is bounded, but 
without being able to determine anything about the nature of the bound or even being 
able to begin computing it. We will assume that the limiting behaviors of the physical 
quantities are effective: that is, any time a constant (implicit or named) is asserted to 
exist, one could furthermore explicitly compute or name such a constant. 

2.1. Versions of the Strominger-Vafa conjecture 

Point 3 above (below fl2.6| )) has not really been addressed in the literature. The 
Strominger-Vafa conjecture (^]6|) in fact admits several inequivalent formulations. Two 
plausible versions of the conjecture are: 

SSV Conjecture. {Strong SV conjecture): If {7 n } is a big sequence then 5 , m icro(7) has 
an asymptotic expansion in z n := |Z(7 n )|. More precisely, 



where we use O-notation in the precise sense of asymptotics of sequences in n, and S (a 
quantity discussed below) is positive. 

This is an expectation based on the supergravity approach to black hole entropy for 
CY 3-fold black holes. See, for examples, the discussions in 00|§. The corrections 
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(2.7) 



arise from both M-theory corrections to the 11D supergravity and from quantum effects 
associated with the compactification. The quantity 5 in fl2.7Q depends on details of the 
sequence 7 n , e.g., on the details of the directions of the 7 n in H 3 (X; 7£) as z n goes to infinity. 
The SSV conjecture might appear to be obvious from the point of view of supergravity. 
However, a systematic scheme for calculating corrections to the leading order supergravity 
approximation to M-theory is unkown. Moreover, it is known that supergravity, which 
treats charges as continuous, can miss subtle arithmetic properties. (For example, the 
number of ^/-duality classes of 7's with a fixed value of 1^(7) | can be 1 in supergravity, 
but in fact is given by class numbers in the exact formulation |]J.) The SSV conjecture is 
also suggested by the proposed formulae for exact results on 5 , m icro(7) in the case when 
X = S x E with S a K3 surface and E an elliptic curve |J . Using the Cardy formula i one 
can justify the SSV conjecture for this special class of Calabi-Yau 3-folds. However, the 
existing proposals, based on elliptic genera of symmetric products are only firmly justified 
for 5D black holes, and the extension to 4D black holes - even for compactification on a 
six-dimensional torus - is nontrivial. Moreover, the generalization of the existing proposals 
for dim 7-^(7) to generic Calabi-Yau 3-folds will be much more subtle and intricate. 
For all these reasons, we are also willing to entertain the 

WSV Conjecture. {Weak SV conjecture): If {7 n } is a big sequence then 

,. log5' m icro(7n) 

llm 1 2 

n^oo log 

Of course, the SSV conjecture implies the WSV conjecture, but the converse is false. 
The motivation behind the formulation of the WSV conjecture is the following. It might 
be that for large 7, the microscopic entropy actually behaves like 

Smicro(7n) ~ TTZ^ (log Z n ) ° , (2.9) 

where a is a random variable, chosen using a measure on H^{X;~Z^). Presumably the 
distribution would become more sharply concentrated around a = as 7 becomes larger. 

3 The Cardy formula is a generalization of the Hardy- Ramanuj an formula for the asymptotics 
of the partition function and gives the asymptotics of conformal field theory partition functions. 
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(2.8) 



As far as we know there is nothing wrong with this idea, and from some viewpoints it even 
seems likely. For example, the existence of BPS states can depend on arithmetic properties 
of 7. So there might well be a large amount of "scatter" and "noise" in the behavior of 
dim 7^(7) as 7 — > 00. (See Fig. 1 in section 5 below for an illustration of the kind of scatter 
we mean.) In this case the Bekenstein-Hawking formula would not be the leading term in 
a systematic expansion, but would only hold in some average sense. 

If the dimensions dim 7^(7) indeed behave in the way just suggested, then it is quite 
possible that the limit ( |2.8| ) does not exist in the standard sense. In such a situation it 
is more appropriate to use the lim-sup, (also denoted as lim) which always exists for any 
sequence of real numbers. I Thus we could replace the limits in the WSV conjecture by 
lim or lim. We refer to these as the WSV and WSV conjectures, respectively. 

There are yet other inequivalent formulations of the SV conjecture. 

3. The height conjecture 

3.1. Sharpening the height conjecture 

The main conjectures of [|l| (conjectures 8.2.1, 8.2.2, and 8.2.3) posit that the attractor 
variety X 1 can be defined as an arithmetic variety over some number field K 1 . This was 
verified in |l[ in some special cases, for example when X is a free quotient of K3 x E 
(where E is an elliptic curve), or when X is a complex torus. In a much more speculative 
section, also suggested a possible connection between the Faltings height for a metrized 
line bundle ht(X 7 ; K^) and the entropy. We want to investigate the consequences of these 
latter conjectures. 

We begin by making the "height conjecture" more precise. The first point to note 
is that if an attractor variety X 1 indeed satisfies the attractor conjectures, then it might 
nevertheless admit several different arithmetic "models" with quite different arithmetic 
properties. First of all, changing the field of definition of X can alter the arithmetic 

4 We recall the definition. If a n is any sequence of real numbers let b n = {a n , a ra +i, . . .}. Then 
let c n := sup b n be the smallest constant bounding b n from above. Plainly, the c n form a strictly 
decreasing sequence of real numbers. We define lim a n = lim c n . 
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properties. We will not investigate the issues of "base change" systematically, although we 
note that the Faltings height does stabilize under field extension |H| . A second ambiguity, 
of more direct relevance to what follows, is that two varieties X, X' over a field K might 
be isomorphic as varieties over an algebraic closure K (such as K = (D for a number field), 
but fail to be isomorphic as varieties over K. We will quote some relevant examples in 
section 3.3 below. 

As with the SV conjecture, there are various inequivalent precise formulations of the 
height conjecture: 

Strong Height (SH) Conjecture: Assume a family of arithmetic attractor varieties 
(X ln ,K ln ) corresponds to a big sequence, that is, z n := \Z(^ n )\ — > oo. Then there exists 
a finite positive constant k (possibly depending on the family) and a choice of model for 
X ln such that if ht(AT 7 ; K^) is the Faltings height for the metrized line bundle provided 
by the Calabi-Yau data, then 

exp[-ht(X 7 „;if 7 J] = S micro ( 7n ) +o(|z n | 2 ). (3.1) 

Alternatively, with the same hypotheses, we can formulate the 

Weak Height (WH) Conjecture: 

nm ht(AV;if 7 J _ r 

n^oo log[5' m i cro (7 n )] 

Of course, since not much is known about either numerator or denominator in ( |3.2|) , 



it is prudent to allow alternative formulations WH, and WH of the conjectures, with 
constants k and k, respectively. If the WH or SH conjectures turn out to be true, then the 
extent to which the constant k depends on the family of charges will become an interesting 
question, as will become apparent in section five. 

Let us return to the ambiguities of base-change and choice of model for X 1 . Regarding 
base change, our hope is that the choice of base field, while necessary to define the heights, 
should do no more than change the value of the constant k in the height conjectures. 

Regarding the choice of model, the attractor equation of supergravity only specifies 
the attractor variety X 1 as a variety over (D. If X 1 can be defined over a number field 



Kj, then since there can be inequivalent models over we must ask if the choice of 
model has any relevance for the physics of string compactification on X. A physicist's 
natural reaction to this question would be that the choice of model should be irrelevant, 
by general covariance. i However, as seen in PJ, the arithmetic of the number fields 
associated to attractor varieties is related to such physical quantities as the BPS mass 
spectrum. Whether or not the choice of arithmetic model is also of physical relevance 
remains to be seen. In fact, the height conjectures above are the first instance, of which 
we are aware, in which such a choice really matters. This raises the interesting question 
of whether physics indeed selects a distinguished arithmetic model, i 

3.2. A special class of attractor varieties 

Some weak evidence for the height conjecture was given in [|l] in the case when X is 
a complex torus. In this case it was shown that X^ is isogenous to a product of 3 elliptic 
curves with complex multiplication: 



X 1 ~ 



£r( 7 ) X -EV( 7 ) X £y( 7 ) (3.3) 



where E T : = (D/(Z + rZ) and t(j) = i^/T^y). Here ^4(7) is an integral quartic polynomial 
on Hs(X; 7L) related to the quartic Eyj invariant (see [|l| for precise definitions). Moreover, 
one easily verifies in this example that: 



l^(7)| 2 = Vhhl (3-4) 



while the supergravity analysis shows that indeed 5 , SU gra(7) = jl^] ||13|| []H 



Equation (|3.3|) is only a statement about isogeny classes. In order to estimate the 



height it is more convenient (but not absolutely necessary) to pin down the attractor 



5 Roughly speaking, general covariance refers to invariance of a theory of gravity under C°° 
diffeomorphisms. A blind application of this principle would also suggest that complex structure 
is physically irrelevant (which is hardly the case). The main thesis of this exploratory paper is 
that not only complex structure, but even arithmetic structure is physically significant. 

6 A good place to start thinking about this might be Witten's linear sigma model formulation 
of CY sigma models, where a definite choice of projective model (albeit over C) is made by the 



quantum field theory |11|. 
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variety exactly. We will eliminate the unkown isogeny by choosing "diagonal charges." 
That is, using equations (6.2) — (6.5) of 0], with P lJ = p l S l ^Qij = qidij in the notation 
of that paper, we may choose lattice vectors 7 G H 3 (X;^.) depending on eight integers 
r, p 1 ,]? 2 ,]? 3 , s, q±, q2, and qs such that we have the equality 

X 1 = E T1 x E T2 x E T3 , (3.5) 

where now we have 

33 3 
7 4 (7) = 2 [($>^) 2 " X)(P*?i) 2 ] - (rs + J^P 1 ^ 2 + 4 K^ 3 - spVp 3 ) (3.6) 

i=l i=l i=l 

and, defining D = —I4 < 0, n are given by: 

2(p 1 q 1 - p 2 q 2 - p 3 q 3 - rs) + VD 
Tl = j / o 3 1 \ ( d - 7 ) 

with cyclic permutations on 123 giving the formulae for T2, T3. 

Now let us discuss some arithmetic associated with the elliptic curves E Ti . In general, 
if r is of the form (—b + \^D)/2a for integral a,b,D, with I? < 0, then it follows that 
TL + t7L is a proper fractional ideal for some order O of the field Kd := Q(v^D). To be 
more precise, let the minimal polynomial of r be At 2 + Br + C = where A, 5, C are 
integral. (Note: it need not be true that a = A, or b = B.) Then by |I5 ], Lemma 7.5, 



2 + tTL is a proper fractional ideal for the order O = 7L + At7L of Kd- In general, the 
conductor of the order O will be larger than one. By [jj5 |. Theorem 11.1, the value of the 



modular function j(r) generates the ring class field of this order, i.e., the ring class field 
is K£){j{r)). It follows from ( |3.5| ) that X is at least defined over the compositum of three 
such ring class fields (it might well be defined over a smaller field). 

We would like to eliminate some of the complications of general ring class fields and 
work instead with the Hilbert class field Kd of Ke>. This will simplify the height compu- 



tation below. As explained at length in [|15| , the distinct ideal classes of the ring of integers 
O(Kd) of Kd have representatives 2 + r^Z. Here k = 1, . . . , h(D), and h(D) is the class 
number of Ke>. The complex numbers have the form = (— b^ + \^D)/2a,k and are the 
solutions in the upper half-plane to a^r 2 + bkT + Ck = where a^x 2 + bkxy + c^y 1 runs over 
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representatives of inequivalent primitive binary quadratic forms of discriminant D. We can 
choose representatives in the standard "keyhole" fundamental domain for PSL(2, Z). We 
will refer to the as "Heegner points." One of the beautiful statements of the theory of 
complex multiplication is that for any k, j(rk) is an algebraic integer and Kd = KoijiTk))- 
Now let us consider some explicit charge vectors 7. First we take p = p 1 = p 2 = p 3 , 
and q = qi = q2 = Q3 so that we only have to work with a single elliptic curve with modular 
parameter r = 7*1 = T2 = T3 given by 

-2(pq + rs) + VD 



4(p 2 + rq) 

Here D reduces to — J4, with I4 given by 



(3A 



Ii(p, q, r, s) = 12p 2 q 2 + 4(rq 3 - sp 3 ) - (3pq + rs) 2 . (3.9) 

Next we choose p, q, r, s so that r is one of the Heegner points. We will content ourselves 
with finding charge vectors 7d which yield the principal class (the trivial class). This is 
given by 

r = (3.10) 

if D = 1 mod 4 and r = \[D if D = mod 4. Specifically, if we choose r = and 
s = 3 + 4_D, then with p = q = 4, we get (|3.10|). If p = 1, g = 2, we get 



r = -l + v A D, (3.11) 



which is clearly PSL(2, Z)-equivalent to r = vD. 

Presumably, other charges 7 can lead to other ideal classes in Kd, but we will focus 
on the above sequence of charges in this paper, and will call them 7^1 . As explained in 
the next section, we will take as the field of definition of the special attractor varieties 
associated with the charges 7^. 

3. 3. Silverman's formula for the height of an arithmetic elliptic curve 

Now that we have focused on the attractors X lD let us compute their height. Since 
X lD is a product of 3 elliptic curves we have 

hX{X lD1 K D ) = 3ht(E T{lD y,K D ), (3.12) 
10 



and therefore we need only compute the height of an elliptic curve. This has been done in 
a very explicit way by Silverman in chapter 10 of ||10|1 . We now review his formula. In the 



next section we apply the formula to our considerations. 

To begin, we must review a few standard definitions. (See any textbook on elliptic 



curves, for examples [ 16|fl7 l ||l8|] .) Let K be a field. An elliptic curve E/K can be given 
by a Weierstrass model 

y 2 + a\xy + a^y = x 3 + a 2 x 2 + a^x + (3.13) 

where G K. Two equivalent Weierstrass models for the same curve over K are related 
by a change of coordinates 



x' = u 2 x + r 



y' = u 3 y + su 2 x + t 



(3.14) 



with u G K* , r, s, t G K. 



We introduce the discriminant A through the standard definitions: 
b 2 := a\ + 4a 2 

64 := a±as + 

b 6 := al + 4a 6 

b% := a 2 a,Q — 010304 + 4o20g + 0203 — 04 (3.15) 

c 4 := b\ - 246 4 

c 6 := -b\ + 36b 2 b 4 - 2166 6 

A := -blb 8 - 8b 3 4 - 27b 2 6 + 9b 2 b 4 b 6 
If the elliptic curve is nonsingular then A ^ and we can define the j-invariant: 

j := cl/A. (3.16) 

When the characteristic is not 2 or 3 it is useful to note that 2 6 3 3 A = c\ — c|. 



Under the change of variables ( |3.14| ) we have: 



/ 4 

C 4 = U C4 
1 6 

c G = u c 6 



A' = u 12 A 



f = J 



(3.17) 



11 



Curves with the same j-invariant are isomorphic over the algebraic closure of K, but need 
not be isomorphic over K. 

Now let K be a number field and E an elliptic curve over K. The field K has a 
set of valuations. These consist of archimedean valuations ("the places at infinity") and 
nonarchimedean valuations ("the finite places"). The places at infinity correspond to 
inequivalent embeddings of K into IR or C. In our example below all the embeddings will 
be complex embeddings ipi : K C, so we henceforth assume this is the case for K. Under 
these embeddings E(C) will be isomorphic to C/(Z +Tj2). The finite places correspond to 
valuations labelled by the different prime ideals in K. Let p be a prime ideal in K. We 
can then consider the curve E p over the p-adic completion K p which is just given by the 
same equation for E as before, but now considered over this larger field. Let (A) p be the 
discriminant of E p (this is an ideal in 0(K P )). A minimal model for E p (at p) is obtained 
by making changes of coordinates of the form ( |3. 14j) (with the field now taken to be K p ) 
such that the power of p dividing (A) p is the smallest possible. For the minimal model 
let us say p np |(A) p , but p n p +1 /f(A) p . In particular, if E is smooth at p then n p = 0, 
and n p > only at those primes which divide (A) in the "global model" of E. Taking the 
product over all the finite places of K defines the minimal discriminant of E as an ideal 
in K: 

VE/K:=Hp np . (3.18) 
p 

We can now state Silverman's formula for the height of E/K (Proposition 1.1, p. 254 
in |TD[ . ) @ The formula for the height involves the sum of the contributions from the finite 
and infinite places: 



12[K : QME;K) = log \N K/ ^(V E/K )\ - 2^ log 



;^) 6 |r 7 24 (r l )|(2 7 r)- 12 



(3.19) 



where is the norm of the ideal, the second term is the sum over inequivalent complex 

embeddings of K, and tj(t) is the Dedekind function. 



7 We will not define the height. See, for examples, [lSl |fL0|[^0| for definitions. For expository 



discussions see [ 21 1 [ 22 1 . Curiously, the Faltings height has appeared before in string theory. See, 



for example, [23 1 , and related papers. We do not understand any connection to this work. 
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Now, to apply (|3.19|) we must choose a field of definition and a Weierstrass model 
for our attractor varieties with r given by (|3.10|) or ( |3.11| ). This will introduce some 
arbitrariness into our discussion. We motivate our choice as follows. If j 7^ 0, (12) 3 then 
one can always write a model for an elliptic curve over Q(j) with invariant j by taking: 

27? 

V 2 = 4x 3 - j-±^ {x + 1). (3.20) 

Thus, one obvious model for the attractor varieties is obtained by making a transformation 
of the form (|3.14[) to get 

y 2 = 4x 3 - g 2 x - g 3 
g 2 = 27j(j-(12) 3 ) 

(3.21) 

^ 3 = 27j(i-(12) 3 ) 2 

A = ^ 3 -27^ 2 = 2 6 -3 12 -j 2 (j-(12) 3 ) 3 . 

Note that we are taking the coefficients in the ring of integers O-g . Note too that the 
Hilbert class field is not the minimal field of definition of this curve. We could take Q(j (r)). 
However, this field has many different conjugates inside the Hilbert class field, depending 
on which Heegner point r is taken. Thus we find the Hilbert class field more natural. 

It is important to note that ( |3.21[ ) is not the only Weierstrass model we could choose. 
We can illustrate the kinds of ambiguities we face in the following two simple sets of 

examples. Consider two families of elliptic curves over Q labelled by an integer n 

2 s 
y = x + n 

(3.22) 

y 2 = x 3 + nx. 

The first family has j = 0, C4 = 0, cq = — 2 5 3 3 n, A = — 2 4 3 3 n 2 and complex multiplication 
by the second family has j = 2 6 3 3 , C4 = — 2 3 3n, cq = 0, A = — 2 6 n 3 and complex 

multiplication by ~E\\J — 1]. Nevertheless, if we consider two curves for n±, n 2 then, if m/n2 
is not a sixth power in the first example or a fourth power in the second example, the 
curves are inequivalent over Q and only become equivalent over extensions of degree 6 and 
4, respectively. Moreover, the minimal discriminants depend on the choice of n. Less 
trivial examples of inequivalent Weierstrass models (with j 7^ 0, 1728) can be gleaned from 
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3.4- Bounding the height 

We now specialize Silverman's formula further and put a bound on the height so that 
we can test the height conjecture. 

In order to specialize Silverman's formula, we note that the values of the j-function 
at the different Heegner points define the distinct embeddings of ■— ► C, so we may 
rewrite Silverman's formula for the height of ( |3.21|) as 

h(D) 

24h(D)h(E,/K D ) = log|iV^ /Q (P -J + £ 

fc=i (3.23) 



K{r) := -2 log 



(3t)V 4 (t)|(27t)- 12 



The sum in ( |3.23|) is over representatives for the ideal class group. In general we will write: 

h(D) 

(f(r))n := ^ E fM ( 3 - 24 ) 

for any function /. For /(r) = 7Z(t) we denote R(D) := (TZ(t))d- Thus, for our special 
charges jd 

R ( D ) + e ' Kd . (3.25) 



ht(X lD ;K D ) = l 



h(D) 

The term R{D) in ( |3.25| ) only depends on 7 through the £7,7 invariant D = —I^'j). As 
we have discussed above, the minimal discriminant can depend on the choice of Heegner 
point defining the curve ( |3.21j ), and even on the choice of i^-isomorphism class for the 
curve. We have, somewhat arbitrarily, chosen ( 3-10|) (|3.11 ) corresponding to the principal 



class and moreover have chosen the Weierstrass model Q3.21| ). 

Now we can put some useful bounds on the height. The minimal discriminant is a 
subtle object and is hard to estimate, even for a CM curve. For some CM curves its norm 
can be as small as 1 (over F = Q(j) under certain conditions on D - see ||25|| ). In ||26|| it 



is calculated for an interesting family of elliptic curves. It is quite possible that further 
results for the model ( |3.21| ) can be obtained from the deep work of Gross and Zagier [ 2~T|j . 



Nevertheless, while it is subtle, we do know that in the model ( |3.21| ) it is a certain integral 



ideal in Kd which divides the principal ideal (A) explicitly calculated in (|3.21| ). From this 
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we may derive some easy inequalities, as in [flj: 

^R(D) < ht(X^ D ,K D ) < ^R(D) + (f(r)) D (3.26) 

where 

/(r) = 61og[(3r) 4 |E 4 (r)| 2 ] + 61og[(3T) 6 |£ 6 (r)| 2 ] + const. (3.27) 

The first inequality in ( |3.26|) does not depend on the choice of Weierstrass model, but the 
second does.. 

The function /(r) grows like 601og(Sr) for r — > zoo and is therefore square integrable 



in the Poincare measure. It follows from a theorem of W. Duke |28| that the average 
converges to 

</(t)>*-- / ^-f(x + iy) (3.28) 
as -D — > — oo. In particular, the integral and hence the limit is finite. 



4. Summary of some analytic number theory 

In sections 4.1 and 4.2 we summarize some well-known facts and definitions from 
analytic number theory. In section 4.3 we summarize some more technical facts needed in 
section 5. 

4-1- L -functions 

Definition 4.1: Legendre-Jacobi-Kronecker symbol. This is the unique, real, 
nontrivial Dirichlet character of modulus D. Its value for n is denoted (^)- 

The LJK symbol (f ) can be computed for D, n ^ as follows. First of all, it is 
completely multiplicative in both arguments: 

(^)H^(^) (—) = (-)(-) I") 

v n n n n±n2 n\ n<i 

Thus it suffices to give its value for n = — 1 and for n prime: 
1. If n = — 1 then 



(^-)=+l D>0 



(4.2) 



-1 D < 0. 
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2. If 77, = 2 then 

(— ) =0 D even 

V 2 ; (4.3) 
= ( _l)(^ 2 -i)/8 D odd . 

3. Finally, if n = p is an odd prime then (^) is the Legendre symbol, i.e. 

(— ) =0 if D = mod p 

= +1 z/ D = x 2 mod p, for some x^O (4-4) 

= — 1 z/ x 2 mod p, for any a; 

One can check that if D = 0, lmod 4, (as in this paper), (^) only depends on the 
residue class n mod 

Definition 4.2: Fundamental Discriminants D < is called a fundamental dis- 
criminant if it is the product of relatively prime factors of the form —4, 8, —8, or 

(-l)(p-i)/2p, p > 3 . 

Equivalently, D < satisfies either (a.) D = 1 mod 4 and D is squarefree, or (b.) 
D = mod 4 , D/4 ^ 1 mod 4, and D/A is squarefree. These are the discriminants of 
quadratic imaginary fields. 

Definition 4.3: L-functions L-functions of conductor D are by definition the infinite 
series 

oo D 

L(s,D):=J2(-)n- s . (4.5) 



n 

n=l 



For example, choosing D = 1 we get the Riemann £ function. Other examples of 
L(s, D) are: 

°° / 1 1 \ 

L(S ' ~ 3) = 5 V(3n + l) s " (3n + 2)-J 



n=0 
oo 



/ 1 1 \ 



n=0 

OO 



(4.6) 



+ 



1 1 

) s ' (7n + 2) s ~ (7n + 3) f 

i 1 1 



n=0 V V y 



(7n + 4) s (7n + 5) s (7n + 6) i 
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The analytic properties of L(s,D) are well known [2S]. The series is absolutely con- 
vergent for Re(s) > 1 and admits an analytic continuation as an entire function of s (for 
D^l). Moreover, for D < we may define 



Z(a, D) := (q/*)W*r(( 8 + l)/2)L(a, D). (4.7) 

Here g is a positive integer defined to be the minimal period in n of the function (— ). 
In general it is a positive integer dividing D, but in our case where D is a fundamental 
discriminant, we have q = \D\. That is, the character f^J is primitive with period \D\ 
exactly when D is a fundamental discriminant. 

It can be shown that £(s, D) is an entire function of complex order one. Moreover the 
zeroes of £ (s, D) come in complex conjugate pairs. Thus £(s, D) has a product formula i 

^D)=e A Y[{l--). (4.8) 
p P 

The completed function £(s, L>) has the functional equation 

= (4.9) 

which is valid only for primitive characters (q = \D\). Otherwise one must make a mod- 
ification to account for prime factors of \D\/q. The zeroes p, called the critical zeroes of 
L(s, -D), lie in the critical strip < Re(s) < 1. 

It is known that s = 1 is not a zero for any D, and in fact 

(We will recall a proof of this below.) Here w(D) is the order of the group of units in Kd\ 
for D < —4 we have w(D) = 2. 



8 This notation is a little sloppy because one should include a convergence factor for the infinite 
product. Alternatively, one can eliminate this by grouping the factors with Qp ^ into complex- 
conjugate pairs (since the coefficients of the Dirichlet series are real.) 
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4-2. The zeroes of L(s,D) 

The LJK symbol is completely multiplicative as a function of n. Combining this with 
the prime factorization of integers one can write another product formula 

)p~ s ) 

V 



L(s,D)= J] (l-(-)p- s ) (4.11) 



p prime 



for the function L(s, D). This is analogous to Euler's product formula for C( s )- While it is 
not immediately obvious, the product formula ( |4.11| ) for L(s, D) encodes the structure of 
primes in the field Kp. Thus, comparing the two product formulae (|4.8|) and ( f4.11| ) gives 
information about primes. Accordingly, the nature of the zeroes of L(s, D) are of interest 
in understanding the arithmetic of Kb- 

This astute remark leads to the central problem of analytic number theory, the gen- 
eralized Riemann hypothesis. It is "generalized" because we are considering the family of 
functions L(s, D); henceforth we use the abbreviation GRH. i A formulation of the GRH, 
suitable for our present purposes, is the conjecture that the critical zeros of L(s, D) all 
lie on the line Re(s) = \. The GRH is of course very difficult, so various sub-problems 
have been studied. One of these concerns the possible existence of critical zeroes near 
s = 1. These are known as "Landau-Siegel zeroes" (LSZ's). Their existence would falsify 
the GRH. More precisely: 

Definition 4.4: Landau-Siegel zeroes. Choose a constant c > 0. A " Landau-Siegel 
zero for c, D" is a real zero s = (3 of L(s, D) with 

1 - — ^— < < 1. (4.12) 
log \D\ 

See [p9j (chapter 21) for equivalent formulations of the Landau-Siegel zero. 

It is possible to explicitly compute an effective constant C2 > such that the neigh- 
borhood of radius C2/log \D\ about the point s = 1 contains at most one zero of L(s, D). 
This rules out complex zeroes in this neighborhood since they come in conjugate pairs. 



9 In this paper GRH always stands for the hypothesis for Dirichlet L-functions, not for elliptic 
curve L-functions. 
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However, the important challenge remains to eliminate the real zeroes. That is, to find an 
effective constant c\ such that there are no Landau-Siegel zeroes for c\ and any D, whether 
positive or negative. In this paper we only study D < 0. 



4-3. The large \D\ behavior of L(l, D) and the zeroes of L(s,D) 

In this section we explain how the zeroes of L(s,D) are related to the large \D\ 
behavior of 

w - (4 ' 13) 

where the derivative is with respect to s. We will begin with some heuristic remarks, and 
conclude with some precise estimates (Theorems 4.1 and 4.2). These estimates will be 
useful in the next section. 

The relation between the "size" or rate of growth of \(D) as \D\ — > oo and the 
existence of zeroes near s = 1 follows from the key identity 

Us,D) = Uog(\D\/n) + 1 -i;((s + l)/2) + ^-(s,D)= £ -U (4.14) 

p|€(p,D)=0 



Here 



:= -f( (4.15) 
T(x) 



is the digamma function. Equation ( |4.14|) follows immediately from logarithmic differen- 
tiation in s of the product formula 



It follows from (Og) that h-(l,D) can grow rapidly with \D\ if L(s,D) has some 



zero p which is "close" (as a function of |Z)|) to the point s = 1. On the other hand, 
zeroes further away from s = 1 have less of an impact on the sum in ( f!.14|) ; their effect is 



governed by their "density of states." It is only the lower-lying zeroes which are important 
for the size of \(D). Therefore, if we can say \(D) is "small" then we will have checked a 
prediction of the GRH, at least in some neighborhood of s = 1. The smaller the bound on 
A(.D) the larger the zero-free region around s = 1. 

We will now explain a trivial bound on the growth of \(D) obtained from bounding 
the numerator and denominator separately. Our argument is sloppy and far from optimal, 
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yet it does explain in simple terms the overall reason why the Landau- Siegel zero is related 
to a small value of L(l, D). 

We first bound the numerator by showing there is a constant C such that 



\L'(1,D)\ <C(log|L>|) 2 . (4.16) 



We prove this as follows. By (|4.5 



n D 
L'(1,D) = - V(-)(logn)n- 1 - V (-)(logn)n- 1 . (4.17) 
z — ' n ^— ' n 

n=l n=\D\ + l 



The first sum is trivially 

\D\ 

(^(log^n" 1 ) = ^((loglDl) 2 ). (4.18) 

n=l 

The second sum can easily be bounded by (9 ((log |-D|) 2 ) using partial summation, since 
changes slowly and 

k+D ^ 

V(-) = 0,A; = 1,2,3,... (4.19) 
z — ' n 

n=fc 

(Roughly speaking, this means there is a lot of cancellation due to the oscillation of the 
Dirichlet character which modulates (logn)n -1 .) 

The bound L'(1,D) = 0((log\D\) 2 ) can be improved. Nevertheless, the intuition is 
that the fraction X(D) can be extremely large only if its denominator is extremely small. 
Therefore we now bound the denominator. A trivial bound comes from ( [4.10|) . Since 
h(D) > 1 it follows that 

1 ^AD\ JW\ 



L(1,D) -Kh(D) ~ n 
Of course, h(D) does grow with D so this estimate is far from optimal! 



10 Note that if h(D) /\D\ 1//2 is large there cannot be a zero near s = 1, while if h(D)/\D\ 1 ^ 2 
is small there might be a nearby zero. This is the essence of the connection to the class number 
problem. Indeed, the work of Goldfeld and Gross-Zagier provides better bounds on 1/L(1, D), 
reverses the logic of the present discussion, and proves a growth rate of h(D) for \D\ increasing. 
This provides a solution to Gauss' problem of finding which imaginary quadratic fields have a 
given fixed class number. 
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Taken together, ( [4.16 ) and (|4.20| ) already show that 

|A(D)| < C"(log \D\) 2 ^/\D\. (4.21) 
From (|4.14j ), the inequality ( [4 . 2 1| ) rules out zeroes at (roughly) a distance of order 

(logpin^- 172 

from s = 1. Of course, the sizes of L'(1,D) and L(1,D) are related to each other (by 
integration of L'(s,D) in s), but it is surprisingly difficult to improve on | £> | ^-factor m 
the trivial bound ( |4.21|) . 

Let us now see what the GRH has to say about the growth of X(D). First of all, 



Littlewood proved that the GRH implies 1/L(1,D) = 0(loglog |D|) as D -> -oo flgQfl. 
Moreover, he showed that the GRH implies 

|A(£>)|=0(loglog|£>|), (4.22) 

and hence, in particular, that the GRH forces 

lim 7^ = 0- (4.23) 

D^-oo log \ JJ\ 

In fact, one may produce sequences of D's such that X(D) grows roughly as loglog|£)|. 
All of this is evident from the following 



Theorem 4.1 [Miller - in the appendix to this paper, [3TJ]. Then 



X(D) 1 
lim sup — — > - 

d^-oo log log \D 2 

r KD) 1 
hminf ; — r < 

D^-oo log log \D\ 2 

where X(D) = L'(l, D)/L(l, D). These inequalities hold, in fact, if D — ► — oo along 
fundamental discriminants. 

Of course, the GRH might actually be false! Evidently, there is a good deal of room 
between the trivial bound (|4.21| ) and the consequence Q4.23] ) of the GRH. The elimination of 



Landau-Siegel zeroes is thought to constitute an important beachhead on the way towards 
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understanding the GRH. The following theorem gives a criterion to rule out the existence 
of Landau-Siegel zeroes: 



Theorem 4.2 Suppose D n is a sequence of fundamental discriminants with D n — > — oo. 
Suppose furthermore that the limit 

is finite. Then there exists a positive constant c, 

c= (l +su ,^\y (4.26) 



2 log|Z> 

such that there are no Landau-Siegel zeroes for c among the D n . 

Proof: Note that the zeroes p are either real or come in complex conjugate pairs. Writing 
p = (3 + Z7 for the real and imaginary parts we have: 

^ T^p~ = ^ (l - p) 2 + 7 2 (4 ' 27) 

and since the zeroes are in the critical strip, this is a sum of nonnegative terms. Thus, if 
p* = /3* is a real zero of L(s, -D) then 

1 < ^ 1 

= -log(|D|/7r) + -V(l) + A(D) 



where in the second line we have used (|4.14|) . Recall that i/j(x) is the digamma function 

(ME)- 

Since ( |4.27[ ) is positive and the constant — ^log7r + = —0.86 is negative it 

follows from (|4.28|) that 

2 log \D\ v ! 



for any D. Therefore, finiteness of (|4.25|) implies that 



sup M < oo, (4.30) 
log LDJ 
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Now we use 



\{D n ) < sup 



\MDn) 

loglLt 



log 



Then (f4.28|) shows that if /?* )Tl is a real zero of L(s, -D n ), then 

1 



1 /3*,r 



1 |A(Z)„) 

< ( - + SUP L±r£t 

2 log \D r 



log LD r 



(4.31) 



(4.32) 



Now comparing with the definition (|4.12| ) we see there is never a Landau- Siegel zero for 

\\(D n ) 



(c, D n ) for c = + sup 

An alternative formulation of this proof (which has the advantage of naming the 
constant c at the sacrifice of ignoring a finite number of cases) is to use the literal definition 
of the lim sup: for each e > one can find an N(e) such that 

\(D r 



for all n > N(e). Then 



1 



su Pm> 



1 



lOg \D r 



< 6 



<(^+S + e) logllX 



(4.33) 



(4.34) 



1 ft* , n 2 

for n > N(e), which leads to the conclusion that there is no Landau-Siegel zero for (c, D n ) 
for c = (| + S + e)" 1 and n > N(e). 4 
Remarks. 

1. Put differently, if there are c's — > such that each has some Landau-Siegel zero, then 
those discrimiants D n — >• — oo satisfy 

\(D r 



lim 



lOg \D r 



oo. 



2. The issue of effectiveness enters in the following ways. In the first argument, one 
might know 5 exactly but not know what sup j^~rpl is - we may know the limiting 
behavior of this sequence but not be able to bound how long it takes for this sequence 
to exhibit it. In the second argument, we may not be able to compute N(e) effectively. 



5. Black hole entropy and critical zeroes 

In this section we will explore come consequences for analytic number theory of the 
various conjectures of sections 2 and 3. 
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The main technical observation is that the height conjectures have implications for the 
behavior of L(s, D) thanks to an equation sometimes called the Chowla-Selberg formula 
||32j| . This is a consequence of class field theory and the Kronecker limit formula. We next 
recall the derivation of this formula. 

The derivation begins with the the nonholomorphic Eisenstein series 



E(t,s) 



V 



mr + n\ 2s 



(5.1) 



where y = 3r and the sum is over all pairs of integers (n, m) ^ (0, 0). The sum is absolutely 
convergent for Re(s) > 1, and admits a meromorphic continuation to the entire s-plane. 
The (first) Kronecker limit formula is the statement (see, e.g., 



p. 273) 



E(t,s) 



TV 



S- 1 



2ir log 



2e"V/2| ??(r) 



+ 0(s-l) 



(5.2) 



as s — > 1. Now, let be a set of representatives of the h(D) Heegner points as in section 
3. A simple application of class field theory shows that 



w(D)( l -^) s/2 ((s)L(s,D)= Y^E(r h s). 



h(D) 



(5.3) 



We now take s — > 1 and compare the two sides of the equation. Equating the residue of 
the pole leads to the standard result ( |4.1U| ). Comparing the constant terms leads to the 
Chowla-Selberg formula 



R(D) = 12 



log V-D + X(D) + log [87r 2 e - 7 ] 



(5.4) 



where R(D) is defined by (ggg)(ggg). 

We are now ready to combine the conjectures of sections two and three with the 
results from section four. The first point to make is that the SSV and SH conjectures 
are "probably" incompatible. Let D n — > — oo be a sequence of fundamental discriminants. 
Applying the SSV and SH conjectures to the family of attractors X lDn constructed in 
sections 3.2 and 3.3 we find 



U(X lDn ,K Dn ) = «log[7rV-£>n] +0( 



1 



log[y^D^ 

|zyi/2 '|d„|*/4« 



(5.5) 
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Fig. 1: A plot of A(.D) against \D\, as D runs through the negative prime funda- 
mental discriminants with \D\ < 2000. Unfortunately loglog|D| grows too slowly 
to see numerically the envelope predicted by Theorem 4.1. 



This implies some very interesting cancellation in ( |3.25|) for the following reason. Roughly 
speaking, Theorem 4.1 says that there exist real numbers a± with ct_ < < a+, such 
that A(.D) can actually grow like a± log(log \D\) for some sequence of D's By 



this means that R(D) has scatter as illustrated in fig. 1. For large \D\ the envelope is of 
width — a_) log(log \D\). Suppose, for the moment, that the minimal discriminant 
term in ( |3.25| ) were absent. Then using the SH conjecture we would conclude that k = 3/2 
and moreover, by choosing suitable sequences of discriminants D n , the charges r )r> n would 
produce black holes with entropies 5 , m i Cro (7D„) that actually grow with \D n \ like 

SUctoCtaJ ~ n\fWfr°%> \Dn\f (5.6) 

for various constants C in the range ol- < C < a+. Moreover, Theorem 4.1 asserts that 
there exist sequences {D n } which actually realize the extreme cases a±. 
Now let us restore the minimal discriminant term 

MP) ,5 ' 7) 



in (p. 25 ). As we have discussed, this depends on the choice of Heegner point and the Weier- 



strass model. It would be fascinating (though we think unlikely) if for general sequences 
of charges r yD n with D n — > — oo one could systematically choose Weierstrass models such 
that the minimal discriminant in ( |3.25|) fluctuates to match the changes in X(D n ) in the 
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way demanded by the SSV conjecture. Since we think it is highly unlikely, we conclude 
that SSV, and SH are probably incompatible, as stated above. We could be more precise 
if we had better information on the size of the minimal discriminant X> . 

hi^ I Kb 

Put differently, if one could choose families of attractor points with a bounded value 
of (|5.7| ) then the strong height conjecture implies that the distribution of 

log dim 7^(7) 



7T 



(5.f 



plotted against y/I^yj would have a lot of scatter, similar to the scatter in fig. 1, around 
the average value 1. 



Let us now assume WSV and WH (or WH). Then, from (p78|)( |3.2| ) and the first 
inequality in (|3.26| ) we get 

lim" < 8 «- ( 5 - 9 ) 
Then, from the Chowla-Selberg formula we get 

n log l^l 1 / 2 ~ 12 V ' 

Now, using Theorem 4.2 we arrive at the main statement in this paper: The WSV conjec- 
ture and WH conjecture together rule out Landau-Siegel zeroes for D < 0. More precisely, 
we have 



Theorem 5.1 The WSV conjecture and the WH conjecture together imply the following 
restrictions on Landau-Siegel zeroes: For any sequence {D n } of fundamental discriminants 
with D n — > —00, there exists a positive constant c such that there are no Landau- Seigel 
zeroes for (c, D n ). 

We would like to remark that a similar result (with log|D| replaced by |-D| e , for a 
positive constant e) was proven unconditionally by Tatuzawa in [|34]. Tatuzawa's theorem, 
while weaker, is sufficient for many applications. Two novelties of the present discussion 
are that we have log \D\ rather than a power, and, moreover, the constant c is in principle 



computable from physics (granted the WSV and WH conjectures). 
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Let us now turn the logic around and suppose that the correct configuration of con- 
jectures is WSV, WH, but that there are Landau-Siegel zeroes for D < (thus falsifying 
the GRH). Then there would be sequences of charges 7™ with D-brane configurations of 
anomalously large entropy compared to the Bekenstein-Hawking entropy. 

Even more ambitiously, we can turn things around and assume the "safest" set of 
conjectures: GRH, WSV, and WH, and see what they predict. From the GRH we have: 

R(D n ) 



lim 



°o logl-Dnl 1 / 2 



12 



(5.11) 



Now, the WSV and the WH conjecture imply information on the mysterious minimal 
discriminants of the elliptic curves (|3.21|) . From ( |3.26|) we get 



lim 



h{D n )\og{\D n \^) 



Ik -12, 



(5.12) 



and in particular, k > 3/2. Note that the second inequality in (|3.26| ) shows that k < 9. The 
second inequality depends on the choice of Weierstrass model (|3.21[) , so we are assuming 



our choice of models is suitable for the conjecture. Of course, replacing WH by WH we 
replace R by k and lim with lim and get a stronger "prediction." 

As we mentioned before, the mini- 



Curiously, some examples support (|5.12 



mal discriminant was computed in |26[ for a certain family of curves over Kd = _ p for 



p = 3 mod 4, and prime. Denoting these curves by A(p), Theorem 12.2.1 of [26| shows 
that D , = (— p 3 ), so the norm over Q is (—p 3 ) 2h and we thus get R = 3 for this 



family. On the other hand, there are also families of CM curves with /q(^ ) ) = ^ 1^1* 

Such a family would have R = 3/2. 



6. Conclusion 

The remarks of section five are built on a house of cards, namely, on a chain of con- 
jectures about some relations between D-branes, black holes, and number theory. The 
weakest link by far in the chain of conjectures is the relation between D-branes and arith- 
metic suggested in 0. Admittedly, it is a long shot. Nevertheless, as we have shown, 
it would have dramatic consequences if true. At worst, there are a couple of interesting 

27 



coincidences, so perhaps it deserves some closer scrutiny. It is curious that the choice of 
arithmetic model has some relevance for "predictions" such as (|5.12|) . Whether this turns 
out to be an interesting feature or a fatal flaw of our discussion remains to be seen. 
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Appendix A. : Large Values of 77(1, — p), by Stephen D. Miller 

In the above paper we made use of the fact that the generalized Riemann hypothesis 
(GRH) implies that 

A(£>)=0(loglog|£>|), 

where 

X(D) - L ' {1 > D) 
X{D) ~ L(1,D)- 

Also, we discussed the implications of how the log log rate is in fact optimal. The 



upper bound is well-known; a proof can be found in |35| , for example. The purpose of this 
appendix is to prove the lower bound: 

Theorem A(=4.1 above) Let X(D) = ^ and D = -p. As p — > 00 among the 
primes which are 3 modulo 4, 

X(D) 1 

hm sup - — — — > - 

loglog|.D| 2 
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and 



lim inf 



HP) < i 

log log \D\ ~ 2 



The constant \ is far from optimal; any non-zero constant will do for the application 
in section 5 above. We remark that the discriminants D = —p are a very special kind; for 
one thing, they are fundamental discriminants. Therefore, the conclusion of the theorem 
still holds if we weaken our conditions on D, e.g. if it passes to — oo over all discriminants. 

One can prove this theorem under the assumption of GRH by modifying the technique 
of Littlewood ([|3(|), who was the first to prove the analogous result for L(l, D). We will 
present a different proof here based on the methods of |36| and |57|, which has the advantage 
that it is unconditional and that it allows us to infer the theorem over a set as sparse as 
the primes. Shorter proofs are possible, but we present the one here because all of the 



background material is contained in the standard reference ||29|| . Extremal theorems of 
this type are proved in two steps: first one truncates the dirichlet series definition for each 
individual L(s, D), and then averages the remaining finite sum over many D to show the 
theorem in the mean. Since one term must always meet the size of the average, we conclude 
that there are large individual values. We will use sieve methods for each of these steps. 



A.l. Character sums 



The material in this section is modified from ||37 ], chapter 22 



Lemma 1 [Lemma 22.4 of fl37|] Let xiy • • • > XJ be distinct primitive characters to moduli 
< Q, and let a n be arbitrary complex numbers. Then 



J 

£ 



N 



N 



< 



(N + Jc Q\ogQ)J2 



n=l 



for some absolute constant cq > 0. 

We will use Lemma 1 to make a series of estimates. The exponents we use are 
somewhat arbitrary and are hardly optimal. 



Proposition 2 Let Q be large and (logQ) < U < Q , b p G C , |6 P | < 1 for prime p 



29 



Then at most 0(Q 3 ^ 4 ) of the distinct primitive characters x to moduli < Q violate 



E 

U<p<2U 



p 



(A.l) 



Proof: Let xii • ■ ■ > Xj De these violating characters. Let a n be defined by the expan- 



sion 



N 



U<p<2U 



bp lo &(p)x(p) 



n=l 



P 



N = (2U) 1 



Then 



a Pl ... Pr 



J Pl U Pn 



ml 



■(logpi)---(logp m )//(pi,...,p m ) < — (log2£f) m , 
Pi---Pm U m 



where /x(pi, . . . ,p m ) < m ' is the multiplicity from the expansion. Also, 



E 

U<p<2U 



bp log p 



P 



< 



E 



U<p<2U 



logp 

P 



= \og{2U/U) + o(l) < 1 



for large Q. Consequently 



N 



U<p 1 ,...,p rn <2U 



n=l 



\b Pl \- ■■\b Pm \(logpi)-- -(log Pr 

Pi ■ ■ -Pm 



-H(p!,.. .,p m ) 



E 

U<p<2U 



bp logp 



P 



< 1, 



and so 



JV 



N 



71=1 



71=1 



l a »| 2 ^ (max|a n |) ^ M < ^(log2E/") 



By Lemma 1 applied to the exceptional characters xi, . . . , \Ji 
jjj-m/5 < ((2U) m + Jc Q log Q)m\ (^^L 

This estimate will be more than sufficient for our purposes, and in fact we will use ml < m m 
to deduce the weaker 



(777 loff 2U\ 
— u — ) 



(A.2) 



30 



Now we shall set m = 
In Lemma 3 we show that 



11 logQ 
8 log {7 



+ 1, where [•] denotes the greatest-integer function. 



£/-/*> 2c„QlogQp|^ 
holds for large Q. Granted this, (|A.2| ) now implies 

jU-m/5 < 2 ■ (2m\og2U) m 



for large Q. We claim that 



J <2 ( 2mU 1 / 5 log 2U ) = 0(Q 3 / 4 ) 



If this were not so, then for any C > we would have that 



m 



\og(2mU 1/5 \og2U) > - logQ + C. 



Adjusting C, 



1 3 
m log 2 H — m log U + log(m log U) > - log Q + C 
5 4 



mlog2+ ^ili og Q + Iiogf/^J +l og ^logQ + log^ > jjlogQ + C 

27 27 3 

mlog2 + — logQ + log(y logQ) > -logQ + C. 

Readjusting C again slightly, we find 

11 log Q 



and finally 
which is absurd. 



log2 + loglogQ > — logQ + C 
8 log U 40 



11 logQlog2 , , ^ ^ 3 , _ 
T om i n +loglogQ>—logQ + C, 
8 20 log log Q 40 



Lemma 3 Keeping the notation of the above proof, let C be any fixed constant. Then 
for Q sufficiently large, 



U~^> CQ log Q 
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Proof: Otherwise 



which implies 



4ttz 

—— log U < log C + log Q + log log Q + m log(m log 2U) , 
5 



11 4 

— -logQ < (1 + <$)logQ + mlog(mlog2*7) 
8 5 



for some 6 > and Q large. Continuing 



1 



— — 5) log Q < m log(m log 2U) 



< ( lllogQ + 



1 (logtf + 2) 



< 



11 log Q 

8 log 17 ' V "° V V 8 lo S U 
11 log Q V /ll 11 log Q . 

Tioir7 + 1 J log U logQ + logt/+ Tioir7 + 21 - 



Using (logQ) 20 <U< Q 2 , introducing a tiny 5' > 0, and taking Q larger yet, we get 

l 10 J SV ~V 8 20 log log Q J S V4-20 loglogQ 8 SV 

^( 1 + *0^ r ^r^log(7/21ogQ). 
160 log logQ 

This is a contradiction for large Q, because ^ > 



Proposition 4 With the same notation as in Proposition 2 and Lemma 3, but instead 
requiring that U > Q 2 , at most 0(Q 2 ^ 3 logU) distinct primitive characters to moduli < Q 
have 



E 

U<p<2U 



P 



> Q 



-1/3 



Proof: Again, let Xi, • • • , Xj be these characters. By Lemma 1, 

2 



JQ~ 2/S <(U + Jc Q log Q) J2 

U<p<2U 



bp log p 



p 



Since \b p \ < 1, the prime number theorem guarantees that 



E 

U<p<2U 



— logp 

p 



< 



E 



U<p<2U 



(logp) ; 

p2 



o 



2U 



u 



(logo 2 # 



logC/ 
[7 
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Therefore 



or 



JQ" 2 / 3 < C(U + Jc Q log Q) = C x log U + C 2 J Ql °^ Q log U, 



CzU^Q 5 / 3 log Q log t/) < dQ^logU. 

Since U >Q 2 , 

n 5/3 

log Q log < 2Q- 1 / 3 (logQ) 2 



17 

becomes arbitrarily small for large Q, and we conclude 

J = 0(Q 2 / 3 \ogU). 

We will require a form of Dirichlet's theorem on the distribution of primes in residue 
classes. To fix notation we will set li(x) to be the logarithmic integral 

00 dt 



li(x) 
and 



2 



log* 



7r(x, k, I) = #{p < x\p prime, p = I (mod k)}. 
See [^9| for more details on this topic. Siegel and Walfisz proved 

Theorem SW Let e > be fixed. Then there exists a constant c depending on e such 
that 

n(x,k,l) = +0(xexp(-c v / x)) , (k,l) = l 

nq) 

for x > exp k £ . 

Page proved a result which is also valid for smaller x, but with a possible exception 
(related to the Landau-Siegel zero): 

Theorem P Let Q be a positive integer. Then there exists a constant b > such that 
for any modulus k < Q - except for perhaps one exceptional modulus k\ and its multiples 
- we have 

li(x) 



Tt(x,k,l)= + Oix exp(— b\J\og x)) , (k,l) = 1 
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for x > exp (logQ) 2 Q < exp( v /logx). The possible exception ki grows to infinity with 
Q. 

Recall the formula that 

L'(1,X) = _ ^ ^lM! + o(l) 



P 



p prime 

The next proposition shows that one can truncate the sum. 

Proposition 5 For Q large and (logQ) 20 < y < Q 2 , all but 0(Q 11 / 12 ) distinct primitive 
characters x to moduli < Q have 

L'(l,x) ST x(p) 1o SP 



L(1,X) 



p<y 



P 



+ 0(1). 



Proof: Another form of the Siegel-Walfisz theorem gives us that 



L(1,X) 



y: xipn ° gp +o(i). 



p 



p<exp Q 1 / 8 

Given y between (log Q) 20 and Q 2 , we shall dyadically decompose the range from 
(y, exp <2 1/8 ] into intervals (U k ,2U k ], U k = 2 k y < expQ 1/8 . Among these 0(Q 1/8 ) ranges 
we will distinguish those that come before and after the range containing Q 2 , designated 
as (Uko,2Uko\. 

First, for k < ko 



E 

y<p<2 k + 1 y 



X(p) log(p) 



P 



fe 

fc=0 



E 

U k <p<2U k 



X(p) log(p) 



p 



<^(2 fc y)-Vio = 0(2/ -i/io )5 



fc=0 



for characters which satisfy the bound of Proposition 2 in each range. There are at most 
O(Q 3 / 4 (k + 1)) = 0(Q 7 / 8 ) exceptions. 
Secondly, for the k > ko 



E 

2 fc o + 1 <p<expQ 1 / 8 



x(p) logp 



P 



< Q~ 1/3 £ 1 

2 fe y<expQ 1 /8 
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= OtQ-^+VS) = 0(Q" 5 / 24 ) = 0(1) 

with the exception of 

0(Q 2 / 3 l og(2 k+l y)) = 0(Q 2 / 3 (Q 1 /8 ) 2 ) = oiQ 11 / 12 ) 

2 fe y<expQ 1 /8 

characters. 
A 

A. 2. Constructing a set of discriminants 

Having proven Proposition 5, we have now established that if D is a fundamental 
discriminant, then 

L ' (1 ' D) - £ f^+ou) 



L(1,D) ^ \P J P 

except for D in a very sparse set. In fact, we may usually invoke this conclusion for 
fundamental discrimants of the form D = —p, p a prime with p = 3 (4). We shall 
now present a set of candidates for extreme values of L(l, D). While we will not be able 
to directly show that they are large, we will prove an average result which allows us to 
conclude that at least one of them is. 



The candidates: We will pattern ourselves after the argument in ||36|| . Let i be a 
large parameter and set 



Vlogx 



(log logx) 2 

Enumerate the odd primes p%, P3, . . . , p m < y and form the product 

M = 8p 2 p3 ■ ■ -Pm = 4exp(^logp) = exp(y + o(y)). 

p<y 

Not all of the moduli 

M M 
Pm 1 "' Pi 

are divisible by the exceptional modulis h\ mentioned in Theorem P (applied to x). Let 
k = y- be the first of these listed above (i.e. with r the largest) which is not divisible by 
fci; if none exists, then set r = m. The purpose of this construction is to guarantee that 

li(x) 



ir(x, k, I) = -77Y + 0(xexp(-6A/logx)) 
(p[k) 
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for I relatively prime to k. Using the Chinese remainder theorem, we can find two special 
residues l± mod k such that 



±1 , p r ^p<y- 



Define the sets 

S± = {—q \y/x < q < x,q prime, q = l± (mod k), (q, fei) = 1} — E±. 



where E± is the exceptional set of Proposition 5. It follows that the cardinality of S± is 

li(x) 



S± = ^zfy + 0(xexp -by/log x)). For any D G S±, 



L'(1,D) ^ logp_ ^ (D\ logp 

L(1,D) + 4^ p ^ VW P 

= ^ log logs + o(log log*)- £ fR\^E + o(l). (A.3) 

In the next section, we will show that this sum is o(l) on average. 

A.3. Sifting and averaging 

Following , we will use a version of the Renyi sieve to average ( |A.3| ) . 

Lemma R Let Z be a set of Z = \Z\ integers in a range of N consecutive integers, 
f(p) < P and Q(p) > 1 otherwise-arbitrary functions, and set 

r = min ^ P , Q = max Q(p). 

p<Vn P p <Vn 

Let 

Z(p,h) = #{zez\z = h( P )}. 

Then for all but at most 2N Z ® "abnormal" primes < y/N we have that 

Z 



Zip, h)-- 

p 



< 



pQ(p) 

holds except for at most /(p) "irregular" residues. 
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Proof of Lemma R: This follows from Chebyshev's inequality applied to the sieve 
inequality 



p<Q 

(H,p. 158). 



o<Qh=l ^ 



Lemma 6 ( ||36|1 ) Keeping the same notation as Lemma R, assume p is a normal prime. 
Then the number of z G Z in the irregular residue classes mod p is < Z {^-^- + q^j 
Proof of Lemma 6: The number in regular classes exceeds 



p pQ(p)J " " p Q(p)' 

We will always take f(p) = p/(\ogp) 5 and Q(p) = (logp) 5 . 
The average of the sum in (|A.3|) is 

E E (f)^= E Eg)^(p.«- 

Oe5± y<p<(\ogx) 20 XF/ F y<p<(logx) 20 /i=l VF/ ^ 



Lemma 7 This sum, restricted to the normal primes p 7 is 



y<p< (log x) 20 normal h=l 

Proof of Lemma 7: For normal p 



E £(-)— w> = <k* ± : 

±r» ,ht\pJ p 



- — ) 



* E ^ + E w> + E ^ 

fr, regular h irreg. h irreg. 

Q{p) p \Q{p) p J (logp) 5 
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The sum over the normal primes p is thus 

4S± 



< 



E 

p>y 



p(logp) 1 



o(S ± ). 



Lemma 8 The sum in (|A.4j) over the abnormal primes is also o(S±). 

Proof of Lemma 8: First, we demonstrate that there is at most one abnormal 
prime p < exp(y log logx). This is because S±(p,h) = #{-D G S± \ D = h (p)}- By 



Theorem P applied to x and the moduli kp < exp^logx), there is at most one prime 
P < exp(y log logx) which violates 

S±(p, h) = ^- + 0(xexp(-6 v / loi^)) = ^ + 0&)+0(xexp(-b^/h^)). 
p — 1 p p z 

This means p qualifies as a normal prime, because x ~ S±<p(k) logx. 

By Lemma R, the number of abnormal primes p < \fx is 2x ( 1 °s x ) — = 0(4>(k) (logx) 16 ). 
We will bound the sum over the abnormal primes p individually as 



E E 

DeS± y<p< (log x) 20 abnormal 

logy 



D\ logp 



p j p 



<S± 



<s ± 



y 



+ 0[ cp(k)(\ogx] 



y<p< (log x) 20 abnormal 
16 log(j/ log logx) 



logp 

p 



exp(y log logx) 



o(S, 



Proof of Theorem A: Lemmas 7 and 8 show that the average of (|A.3| ) over S± is 
1 ^ L'(1,D) , 1 _ n . 

2^ £7l ]Jj = ±~ log logX + 0(l0g logx). 



At least one term must exceed this average, proving Theorem A. 4jk 
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